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Abstract. In the context of strategic games, we provide an axiomatic 
proof of the statement 

(Imp) Common knowledge of rationality implies that the players will 
choose only strategies that survive the iterated elimination of strictly 
dominated strategies. 

Rationality here means playing only strategies one believes to be best 
responses. This involves looking at two formal languages. One, Co, is 
first-order, and is used to formalise optimality conditions, like avoiding 
strictly dominated strategies, or playing a best response. The other, C v , 
is a modal fixpoint language with expressions for optimality, rationality 
and belief. Fixpoints are used to form expressions for common belief and 
for iterated elimination of non-optimal strategies. 



1 Introduction 

There are two main sorts of solution concepts for strategic games: equi- 
librium concepts and what might be called "effective" concepts. One in- 
terpretation of the equilibrium concepts, for example Nash equilibrium, 
tacitly presupposes that a game is played repeatedly (see, e.g. [13j P a ge 
14]). Thus the standard condition for Nash equilibrium in terms of the 
knowledge or beliefs of the players [3] - the so-called "epistemic analysis" 
of Nash equilibrium - includes a requirement that players know the other 
players' strategy choices. 
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Fig. 1. Two strategic games 



Consider the left-hand game in Figure [TJ in which each player has two 
choices L and R and both players get payoff of 1 if they coordinate, and 

otherwise. Then there are two Nash equilibrist both play L or both 
play R. But this does not translate by itself into an effective strategy for 
either player reasoning in isolation, without some exogenous information. 

In contrast, effective solution concepts, for example the iterated elim- 
ination of strictly dominated strategies, are compatible with such a "one- 
shot" interpretation of the game. Thus the epistemic analysis of the it- 
erated elimination of strictly dominated strategies does not require that 
the players know each other's strategy choice. 

A strategy is strictly dominated if there is an alternative strategy 
ti such that no matter what the opponent does, ti is (strictly) better for 

1 than s, . Say that a player is sd-rational if he never plays a strategy that 
he believes to be strictly dominated. What the iterated elimination of 
strictly dominated strategies does in general require, see jl], is then that 
players have common true belief that each other is rational, that is: they 
are rational, believe that all are rational, believe that all believe that all 
are rational, etc. 

In the right-hand game in Figure (TJ the column player, on first look- 
ing at her choices L or R is, superficially, in the same situation as be- 
fore: choose L and risk the opponent playing D or choice R and risk the 
opponent playing U. However, this time the row player can immediately 
dismiss playing D on the grounds that U will always be better, no matter 
what the column player does. So if the column player knows (or believes) 
this, then he cannot rationally play R, and so must play L. 

In this paper we study the logical form of epistemic characterisation 
results of this second kind, so we give formal proof-theoretic principles 
to justify some given effective or algorithmic process in terms of common 
belief of some form of rationality. We will introduce two formal languages. 
One, Co, is a first-order language, that can be used to define 'optimality 
conditions'. Avoiding playing a strictly dominated strategy is an example 
of an 'optimality condition'. Another one is choosing a best response. 

However, as observed in [2] for all such notions there are two versions: 
'local' and 'global'. Notice that in our informal description of when is 
strictly dominated by tj we did not specify where i is allowed to choose 
alternative strategies from. In particular, since we are thinking of an it- 
erated procedure, if ti has been eliminated already then it would seem 
unreasonable to say that i should consider it. That intuition yields the 

1 A Nash equilibrium in a two-player game is a pair (si, S2) of strategies, one for each 
player such that si is a best response to S2 and vice-versa. 
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local definition; the global definition states the opposite: that player i 
should always consider his original strategy set from the full game when 
looking to see if a strategy is dominated. 

A motivation for looking at global versions of optimality notions is 
that they are often mathematically better behaved. On finite games the 
iterations for various local and global versions coincide pQ , but on infinite 
games they can differ. In a nutshell: an optimality condition (pi for player 
i is global if i does not 'forget', during the iterated elimination process, 
what strategies he has available in the whole game. The distinction is 
clarified in the respective definitions in Co- 

An optimality condition <p induces an optimality operator on the 
complete lattice of restrictions (roughly: the subgames) of a given game. 
Eliminating non-0-optimal strategies can be seen as the calculation of a 
fixpoint of the corresponding operator 0$. Furthermore, common belief is 
characterised as a fixpoint (cf . Note [3] below) . Viewed from the appropri- 
ate level of abstraction, in terms of fixpoints of operators, this connection 
between common belief of rationality and the iterated elimination of non- 
optimal strategies becomes clear. 

We define a language C v that describes things from this higher level 
of abstraction. Each optimality condition defines a corresponding notion 
of rationality, which means playing a strategy that one believes to be <j)- 
optimal. C v is a modal fixpoint language with modalities for belief and 
optimality, and so can express connections between optimality, rationality 
and (common) belief. 

We say that an operator O on an arbitrary lattice (D, C) is mono- 
tonic when for all A,B G D, if A C B then 0(A) C O(B). The global 
versions of relevant optimality operators, in particular of the operators 
corresponding to the best response and strict dominance, are monotonic. 
This is immediately verifiable in Co by observing that the relevant defi- 
nition is positive. 

Our first result is a syntactic proof of the following result, where 4> is 
a monotonic optimality condition!! 

Theorem 1 Common true belief of 4>-rationality entails all played strate- 
gies survive the iterated elimination of non-cj)- optimal strategies. 

Although this theorem relies on a rule for fixpoint calculi that is only 
sound for monotonic operators, the semantics of the language C v allows 
also for arbitrary contracting operators, i.e. such that for all A, O(A) C 

2 By "common true belief" we mean a common belief that is correct. In particular, 
common knowledge entails common true belief. 
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A. We are therefore able to look at what more is needed in order to justify 
the following statement (cf. [H Proposition 3.10]), where g&r-rationality 
means avoiding avoiding strategies one believes to be never best responses 
in the global sense: 

Theorem 2 (Imp) Common true belief of gbr -rationality implies that 
the players will choose only strategies that survive the iterated elimination 
of strictly dominated strategies. 

This theorem connects a global notion of g&r-rationality with a local 
one, referred to in the iterated elimination operator. Our language allows 
for arbitrary contracting operators, and their fixpoints to be formed, and 
we exhibit one sound rule connecting the resulting fixpoints with mono- 
tonic fixpoints. 

Our theorems hold for arbitrary games, and the resulting potentially 
transfinite iterations of the elimination process. The syntactic approach 
clarifies the logical underpinnings of the epistemic analysis. It shows that 
the use of transfinite iterations can be naturally captured in C u , at least 
when the relevant operators are monotonic, by a single inference rule that 
involves greatest fixpoints. 

The relevance of monotonicity in the context of epistemic analysis 
of finite strategic games has already been pointed out in [5], where the 
connection is also noted between the iterated elimination of non-optimal 
strategies and the calculation of the fixpoint of the corresponding opera- 
tor. 

To our knowledge, although several languages have been suggested for 
reasoning about strategic games (e.g. [7]), none use explicit fixpoints (ex- 
cept, as we mentioned, for some suggestions in [5]) and none use arbitrary 
optimality operators. 

Therefore they are not appropriate for reasoning at the level of ab- 
straction that we suggest when studying the epistemic foundations of 
these "effective" solution concepts. For example while Section 13] does 
provide some analysis of the logical form of the argument that common 
knowledge of one kind of rationality implies not playing strategies that 
are strictly dominated, the fixpoint reasoning is done at the meta-level. 
What [7J provides is a proof schema, that shows how, for any finite game, 
and any natural number n, to give a proof that common knowledge of 
rationality entails not playing strategies that are eliminated in n rounds 
of elimination of non-optimal strategies. 

The more general and elegant reasoning principle is captured by using 
fixpoint operators and optimality operators. Another important advan- 
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tage to our approach is that we are not restricted in our analysis to finite 
games. This means in particular that our logical analysis covers the mixed 
extension of any finite game. 

Our use of transfinite iterations is motivated by the original finding 
of |12] , where a two-player game is constructed for which the ujq (the first 
infinite ordinal) and loq + 1 iterations of the rationalizability operator of 
[6] differ. 

2 Games and the language Co 

A strategic game is a tuple (T\, . . . , T n , <\, . . . , < n ), where {1, . . . , n} 
are the players and each Tj is player i's set of strategies, and <j is player i's 
preference relation, which is a total linear order over the set of strategy 
profiles T = fllLi ^i- Note that we assume arbitrary games, rather than 
restricting to games in which T is finite. To depict games it is sometimes 
easier, as we did in Figure [TJ to write down a number for the players' 
"payoffs", rather than just a preference ordering. We use some standard 
notation from game theory, writing s_j for (si, . . . Sj_i, Sj+i, . . . s n ) and 
(sj, t-i) for the strategy profile (t\, . . . U—lj s ij U+i, ■ ■ ■ s n), as well as S-i 
for Ylj-ti Sj. A restriction of the game (T±, . . . , T n , <i, . . . , < n ) is a se- 
quence S = (S\, . . . , S n ) with Si C Tj for all players i, i.e. a (possibly 
empty) subgame in which the payoff information is left out. 

The language we use for specifying optimality conditions is a first- 
order language, with variables V = {x, y, z, . . .}, a monadic predicate C, 
a constant o and a family of n ternary relation symbols ■ >? ■, where 
% € [l..n]. So £o is given by the following inductive definition: 

cf) :: = C(a) \ a > l c b \ -*/> \ <f> A (f> \ 3x(f>, 

where i S [l..n] and {a, b, c} C 1/ U {o}. 

We use the standard abbreviations — > and V, further abbreviate -ia >J. 
6 to b > l c a, Vx0 to — i3oj— i0, 3x(C(x) A 0) to 3x G C 0, and Vx(C(x) -> 0) 
to Vx G C 0. 

An optimality model (G, G' , s) is a triple consisting of a strategic game 
G = (Ti, . . . , T n , <i, . . . , < n ), a restriction G' of G, and a strategy profile 
s G T. G will be used to interpret the predicate C, and s will be the 
interpretation of o. An assignment for (G, G, s) is a function a assigning 
a strategy profile in T to each variable, and s to o. The ternary satisfaction 
relation |= between optimality models, assignments and formulas of Co 
is defined inductively as follows, where a is an assignment for (G, G',s), 
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and \/= the complement of |=: 



(G, G', s) K C{x) & Vz G {1, . . . , n}, (a(x))i G G- 
(G, G',s) \= a x>\y ^ (a(x)i,a(z)-i) >, (a(y)i, a(z)-i) 
(G,G',s)\= a ^ &(G,G',s) ^ Q 
(G, G', s) \= a 0i A 2 o (G, G', s) K 0i and (G, G', a) K 2 
(G, G', s) \= a 3x(j) <^ there is a' : (G, G', s) \= a > and 

Vy G V with x ^ y, a(y) = a'(y) 

If for any assignment a for G we have (G, G',s) \= a then we write 
(G, G', s) (= 0. A variable x occurs free in if it is not under the scope 
of a quantifier 3x; a formula is closed if it has no free variables. 

An optimality condition for player i is a closed £o-f°rmula m which 
all the occurrences of the atomic formulas a >i b are with j equal to i. 
Intuitively, an optimality condition 0j for player i is a way of specifying 
what it means for i's strategy in o to be an 'OK' choice for i given that 
i's opponents will play according to C-i and that i's alternatives are C%. 

In particular, we are interested in the following optimality conditions: 

- Isdi := Vy G C3z G Co >\ y, 

- gsdi := Vy 3z G Co >\ y, 

- g&rj := 3z G G Vy o > l z y. 

The optimality conditions listed define some fundamental notions 
from game theory: Isdi says that Oj is not locally strictly dominated in 
the context of G; gsdi says that Oj is not globally strictly dominated in 
the context of G; and g&rj says that Oj is globally a best response in the 
context of G. 

The distinction between local and global properties, studied further in 
|2j, is clarified below. It important for us here because the global versions, 
in contrast to the local ones, satisfy a syntactic property to be defined 
shortly. 

First, as an illustration of the difference between gbri and gsdi, con- 
sider the game in Figure EJ Call that game H, with the row player 1 and 
the column player 2. Then we have 

(H, (T 1 ,T 2 ),(D,R))\=gsd 1 , 

but 

(H, (Ti,T 2 ), (D,R)) \=^gb n . 
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Fig. 2. An illustration of the difference between strict dominance and 
best response 

The local notions are such that when the 'context' restriction C consists 
of a singleton strategy for a player i, then that strategy is locally optimal. 

50 for example 

(H,({U,M},{R}),(U,R)) ^lsd 2 , 

whereas 

(H,({U,M},{R}),(U,R))\=^gsd 2 . 

We say that an optimality condition <pi is positive when any sub- 
formula of the form C(z), with z any variable, occurs under the scope of 
an even number of negation signs (->). Note that both gbri and gsdi are 
positive, while Isdi is not. As we will see in a moment, positive optimality 
conditions induce monotonic optimality operators, and monotonicity will 
be the condition required of optimality operators in Theorem Q] relating 
common knowledge of (^-rationality with the iterated elimination of non-0 
strategies. 

3 Optimality operators 

Henceforth let G = (Ti, . . . , T n , <i, . . . , < n ) be a fixed strategic game. Re- 
call that a restriction of the game G is a sequence S = (S\, . . . , S n ) with 

51 C Tj for all players i. We will interpret optimality conditions as opera- 
tors on the lattice of the restrictions of a game ordered by component-wise 
set inclusion: 

(Si, ■ ■ ■ , S n ) C (S[, . . . , S' n ) iff S t C S'i for all i G [l..n]. 

Given a sequence <j> giving an optimality condition (pi for each player 
i, we introduce an optimality operator defined by 

n 

<t> (S) = l[{s l eS i | &( 8i ,S).} 

i=l 
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Consider now an operator O on an arbitrary complete lattice (D, C) 
with largest element T. We say that an element S € D is a fixpoint of 
O if S = 0{S) and a post- fixpoint of O if 5 C 0(S). 

We define by transfinite induction a sequence of elements O a of D, 
for all ordinals a: 

- 0° := T, 

- O a+l := 0(O a ), 

- for limit ordinals /3, O 13 := floK/?^"- 

We call the least a such that O a+1 = O a the closure ordinal of O and 
denote it by ao- We call then O a ° the outcome of (iterating) O and 
write it alternatively as 0°° . 

Not all operators have fixpoints, but the monotonic and contracting 
ones (already defined in the introduction) do: 

Note 1. Consider an operator O on (D, C). 

(i) If O is contracting or monotonic, then it has an outcome, i.e., 0°° is 
well-defined. 

(ii) The operator O defined by O(X) := O(X) n X is contracting. 

(iii) If O is monotonic, then the outcomes of O and O coincide. 

Proof. For (i) , it is enough to know that for every set D there is an ordinal 
a such that there is no injective function from a to D. 

Note that the operators are by definition contracting, and hence 
all have outcomes. Furthermore, it is straightforward to verify that if fa 
is positive for all players i, then is monotonic. 

The following classic result due to [13] also forms the basis of the 
soundness of some part of the proof systems we consider H 

Tarski's Fixpoint Theorem For every monotonic operator O on (D, C) 
0°° = uO = U{S g D | S C 0(5)}, 

where vO is the largest fixpoint of O. 

We shall need the following lemma, which is crucial in connecting 
iterations of arbitrary contracting operators with those of monotonic op- 
erators. It also ensures the soundness of one of the proof rules we will 
introduce. 

3 We use here its 'dual' version in which the iterations start at the largest and not at 
the least element of a complete lattice. 
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Lemma 1. Consider two operators 0\ and O2 on (D, C) such that 

- for all S eD, Oi(S) C 2 (S), 

— 0\ is monotonia. 

Then Of C Ch°°. 

Proof. By Note[T](i) the outcomes of 0\ and O2 exist. 
We prove now by transfinite induction that for all a 

or cor 

from which the claim follows, since by Note[]Jm) we have Of = 02°°- 

By the definition of the iterations we only need to consider the in- 
duction step for a successor ordinal. So suppose the claim holds for some 
a. 

The second assumption implies that 0\ is monotonic. We have the 
following string of inclusions and equalities, where the first inclusion holds 
by the induction hypothesis and monotonicity of Oi and the second one 
by the first assumption 

oT +1 = ol{o[ a ) c oT(oT) = Oi(oT)noT c o 2 (oT)noT = oT +1 



4 Beliefs and the modal fixpoint language C v 

Recall that G is a game (Ti, . . . , T n , P\, . . . , P n ). A belief model for G is 
a tuple (J7, Si, . . . ,s n , Pi, ... , P n ), with Q a non-empty set of 'states', and 
for each player i, Sj : .!? — > Tj and Pj : J7 — >• 2 . The Pj's are possibility 
correspondences cf. The idea of a possibility correspondence Pj is 
that if the actual state is uj then Pi(ui) is the set of states that i considers 
possible: those that i considers might be the actual state. 

Subsets of Q are called events. A player i believes an event E if that 
event holds in every state that i considers possible. Thus at the state lj, 
player i believes E iff Pi{u) C E. 

Given some event E we write Ge to denote the restriction of G de- 
termined by E: 

{Ge)i = {si S Ti I 3u € E : Si(u) = sj. 

In the rest of this section we present a formal language C u that will be 
interpreted over belief models. To begin, we consider the simpler language 
C, the formulas of which are defined inductively as follows, where i £ 
[l..n\: 
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ip ::= ratfa | ip A ip \ ->ip \ \ O^ip, 

with 4>i an optimality condition for player i. We abbreviate the formula 

Ai 6 [i..n] rat ^ to ra ^> Aie[i..n] D ^ to DV> and A ie [i..n] °<t>^ to °<^- 

Formulas of C are interpreted as events in (i.e. as subsets of the domain 
of) belief models. Given a belief model (f2,si, . . . ,s n , Pi, ... , P n ) for G, 
we define the interpretation function [■]:£—> P(i?) as follows: 

- [rafyj = G | 0i(si(a;),Gp.( w ))}, 

- [M^] = Wni, 

- hv] = ^-M, 

- \0<i>M ={wefi (G,G M ,Si(6j)) |= 0,}. 

Pi(w) gives the set of states that i considers possible at u, so [rat^J 
is the event that player i is ^-rational, since it means that i's strategy is 
optimal according to <pi in the context that the player considers it possible 
that he is in. The semantic clause for was mentioned at the begin of 
this section and is familiar from epistemic logic: pit/?] is the event that 
player i believes the event [t/;]. [0</ li V'] is the event that player i's strategy 
is optimal according to the optimality condition fa , in the context of the 
restriction Gm. 

Then clearly [rat^J is the event that every player i is ^-rational; 
IO</,?/>] is the event that every player's strategy is ^-optimal in the context 
of the restriction Gkm ; and [[□V'] is the event that every player believes 
the event [[?/>] to hold. 

Although £ can express some connections between our formal defi- 
nitions of optimality rationality and beliefs, it could be made more ex- 
pressive. The language could be extended with, for example, atoms Sj 
expressing the event that the strategy Sj is chosen. This choice is made 
for example in [7], where modal languages for reasoning about games are 
defined. The language we introduce is not parametrised by the game, and 
consequently can unproblematically be used to reason about games with 
arbitrary strategy sets. 

We will use our language to talk about fixpoint notions: common 
belief and iterated elimination of non-optimal strategies. Let us therefore 
explain what is meant by common belief . Common belief of an event E 
is the event that all players believe E, all players believe that they believe 
E, all players believe that they believe that. . . , and so on. Formally, we 
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define CB(E), the event that E is commonly believed, inductively: 



Bi(E) = {ooen\\/i£ [l..n], Pi(u) C E} 
B m+1 (E) = B 1 (B m (E)) 
CB(E) = f| B m {E) 

Notice that B\{E) is the event that everybody believes that E (indeed, 
we have B\ \}\)\ = [□V'J ) , B2 {E) is the event that everybody believes that 
everybody believes that E, etc. 

'Common belief is called 'common knowledge' when for all players i 
and all states uj € i?, we have to € Pi(co). In such a case the players have 
never ruled out the current state, and so it is legitimate to interpret diip 
as l i knows that 

Both common knowledge and common belief are known to have equiv- 
alent characterisations as fixpoints, and we will exploit this below in defin- 
ing them in the modal fixpoint language which we now specify. 

We extend the vocabulary of C with a single set variable denoted by X 
and the contracting fixpoint operator vX. (The corresponding extension 
of first-order logic by the dual, inflationary fixpoint operator [iX was first 
studied in [8].) Modulo one caveat the resulting language C v is defined as 
follows: 

tp ::= ratfa \ (ip A tp) \ ->tp \ 0$ | Ofcip | i/X.i/j 
The caveat is the following: 

— <j) must be v-free, which means that it does not contain any occur- 
rences of the vX operator. 

This restriction is not necessary but simplifies matters and is sufficient 
for our considerations. 

To extend the interpretation function [■] to £„, we must keep track 
of the variable X. Therefore we first extend the function [■]:£—>■ V(f2) 
to a function [• | •] : C v x V(f2) — > V{Q) by padding it with a dummy 
argument. We give one clause as an example: 

- [Q t V I E\ = {u e Q I Pi(u) C fip I Ej}. 

We use this extra argument in the semantic clause for the variable X: 
-{X\E]=E. 

Those formulas whose semantics we have so far given define operators. 
More specifically, for each of them \ip \ •] is an operator on the powerset 
V(f2) of fl. We use this to define the clause for vX: 



11 



- [uX.il>\m = (bl>AX\-l)°°- 

When X does not occur free in ip, we have \ip \ Ej = \ip \ Fj for any events 
E and F, so in these cases we can write simply [t/>]. Note that Jz^X.^J is 
well-defined since for all E we have ftp A X \ Ej = \ip \ Ej n \X | Ej C JS, 
so the operator [[?/> A X | •] is contracting. 

We say that a formula ip of is positive in X when each occurrence 
of X in ip is under the scope of an even number of negation signs (-1), 
and under the scope of an optimality operator Ofc only if (pi is positive. 

Note 2. When ip is positive, the operator \ip | •] is monotonic. 

Then by Tarski's Fixpoint Theorem and Note []Jm) we can use the fol- 
lowing alternative definition of [i/X.^J in terms of post-fixpoints: 

\uX4\ = \J{E Cfi|i?C^| Ej}. 

Let us mention some properties the language C u can express. First 
notice that common belief is definable in L v using the vX operator. An 
analogous characterization of common knowledge is in Section 11.5]. 

Note 3. Let ip be a formula of C. Then [i/X.D(X A ip)j is the event that 
the event §ipj is common belief. 

From now on we abbreviate the formula vX.\3(X A ip) with ip a for- 
mula of C to U*ip. So C u can define common belief. Moreover, as the 
following observation shows, it can also define the iterated elimination of 
non-optimal strategies. 

Note 4- In the game determined by the event lyX.O^Xj, every player 
selects a strategy which survives the iterated elimination of non-^-optimal 
strategies. 

Proof. It follows immediately from the following equivalence, which is 
obtained by unpacking the relevant definitions: 

5 Proof Systems 

Consider the following formula: 

(rat^ A 0*rat<i,) ->• vX.O^X. (1) 
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By Notes [3] and HI we see that ([T]) states that: true common belief that 
the players are (^-rational entails that each player selects a strategy that 
survives the iterated elimination of non-(/>-optimal strategies. 

In the rest of this section we will discuss a simple proof system in 
which we can derive ([I]). We will use an axiom and rule of inference 
for the fixpoint operator taken from [TT] and one axiom for rationality 
analogous to the one called in [7] an "implicit definition" of rationality. 
We give these in Figure El where, crucially, ip is positive in X , and all the 
(pi 's are positive. We denote here by ip[X i-> x] the formula obtained from 
ip by substituting each occurrence of the variable X with the formula X- 
Assuming given some standard proof rules for propositional reasoning, we 
add the axioms and rule given in Figure [3] to obtain the system P. 



Axiom schemata 
rat^ — > (Ox — > Ocpx) ratDis 
vX.ip -> ip[X M> uX.ip] uDis 

Rule of inference 
X -»■ ip[X i ^ x] 



X -> uX.ip 



vlnd 



Fig. 3. Proof system P 



A formula is a theorem of a proof system if it is derivable from the 
axioms and rules of inference. An /^-formula ip is valid if for every belief 
model (J?, . . .) for G we have {ip} = Q. We now establish the soundness 
of the proof system P, that is, that its theorems are valid. 

Lemma 2. The proof system P is sound. 

Proof. We show the validity of the axiom ratDis: 

Let (J?, si, . . . , s n , Pi, . . . ,P n ) be a belief model for G. We must show 
that \rat^ — > — > O^x)] = f2. That is, that for any x the inclusion 
{rat^j n pxl C {O^xi nolds - So take some w G {rat^J n pxl- Then for 
every i £ [l..n], (pi(~s~i(u;) , G p.( w )) , and Pj(u;) C [x]. So by monotonicity 
of 0i, 0j(si(w),G[ x ]), i.e. a; G [O&x] as required. 

The axioms uDis and the rule z^Ind were introduced in [11] ; they for- 
malise, respectively, the following two consequences of Tarski's Fixpoint 
Theorem concerning a monotonic operator F: 



13 



— vF is a post-fixpoint of F, i.e., vF C F(vF) holds, 

- if Y is a post-fixpoint of F, i.e., Y C F(Y), then Y C zaF 

Next, we establish the already announced claim. 
Theorem 1. T/ie formula (FTP is a theorem of the proof system P. 

Proof. The following formulas are instances of the axioms ratDis (with 
tp : = □Vafy A rat^) and vDis (with ^ := D(X A rat^)) respectively: 

rat^ ->• (D(D*rat tt> A rat^) -> 0^(D*ra^ A rafy)), (2) 
□*rafy -> □ ((□*ra^) A rafy). (3) 

Putting these two together via some propositional logic, we obtain 

((□*rafy) A ra^) -> O ((D*ra^) A rat^), 

which is of the right shape to apply the rule vlnd (with x '■= ^rat^Arat^ 
and ip := O^X). We then obtain 

(D*rat 4> A rat^) ->• i/X.O^X, 

which is precisely the formula ([T]). 

Corollary 1. The formula (TTP zs valid. 

It is interesting to note that no axioms or rules for the modalities □ or 
O were needed in order to derive ([T|), other than those connecting them 
with rationality. In particular, no introspection is required on the part of 
the players, nor indeed is the K axiom 0(<p A tp) <-> A □?/>) needed. 

In the language C u , the rat^ are in effect propositional constants. 
We might instead define them in terms of the and Ofc modalities but 
to this end we would need to extend the language C u . One way to do 
this is to use a quantified modal language, allowing quantifiers over set 
variables, so extending L v by allowing formulas of the form yXip. Such 
quantified modal logics are studied in [10] . It is straightforward to extend 
the semantics to this larger class of formulas: 

p/Xtp | Ej = {u: € Q | VF C [2, W G \ip j F]}. 

In the resulting language each rat^. constant is definable by a formula of 
this second-order language: 

rat^ =yX(DiX ^O^X). (4) 

The following observation then shows correctness of this definition. 
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Note 5. For all i € [l..n] the formula Q is valid in the semantics sketched. 

To complete our proof-theoretic analysis we augment the proof system P 
with the following proof rule where we assume that x is positive in X, 
but where ip is an arbitrary z^-free /^-formula: 

x ->• i> 



vX.x — >■ vX.ip 



Incl 



The soundness of this rule is a direct consequence of Lemma (TJ 

To formalize the statement Imp we need two optimality conditions, 
gbri and Isdi. 

To link the proof systems for the languages Co and C v we add the 
following proof rule, where each </>j and ipi is an optimality condition in 
Co, and O^X — >■ 0^,X is a formula of C u . 

(pi ->ip i} ie [l..n] 



o^x c^x 



Link 



The soundness of this rule is a direct consequence of the semantics of the 
formulas O^X and O^X. 

We denote the system obtained from P by adding to it the above two 
proof rules and standard first-order logic rules concerning the formulas in 
the language Co, like 

3y" 



Vx 3y4> 

by R. We can now formalize the statement Imp as follows: 

{ratg br A U*rat gbr ) -> vx.O lad x. (5) 

The following result then shows that this formula can be formally 
derived in the considered proof system. 

Theorem 2. The formula (0) is a theorem of the proof system R. 

Proof. The properties gbri are monotonic, so the following implication is 
an instance of (fl|): 

(ratgb r A 0*ratgb r ) — > ux.OgbrX. 

Further, since the implication gbr^ — > Isdi holds, we get by the Link 
rule 

VX.Og br X ->■ UX.OlsdX, 

from which ([5]) follows. 

Corollary 2. TTie formula (0j is waZid. 
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6 Summary 



We have studied the logical form of epistemic characterisation results, 
for arbitrary (including infinite) strategic games, of the form "common 
knowledge of (^-rationality entails playing according to the iterated elim- 
ination of non-c// properties" . A main contribution of this work is in re- 
vealing, by giving syntactic proofs, the reasoning principles involved in 
two cases: firstly when (ft = 4>' (Theorem [T]), and secondly when cp entails 
eft' (Theorem [2]) . In each case the result holds when (f> is monotonic. The 
language C u that we used to formalise this reasoning is to our knowledge 
novel in combining optimality operators with fixpoint notions. Such a 
combination is natural when studying such characterisation results, since 
common knowledge and iterated elimination are both fixpoint notions. 

The language C u is parametric in the optimality conditions used by 
players. It is therefore built on the top of a first-order language Co used 
to define syntactically optimality conditions relevant for our analysis. 
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